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Introduction. Recent work -and a remark dropped casually by my statistical 
colleague, Albyn Jones—has led me to the speculative perception that “the 
quantum world, under certain circumstances, appears classical” for ultimately 
the same reason that experiments repeated many times over can be expected to 
yield results of high accuracy, and that the statistical properties of thermalizecl 
systems are so sharp as to become susceptible to analysis by the methods of 
thermodynamics. 

The classical theory of errors is dominated by the normal distribution for 
reasons rooted in the central limit theorem. The train of thought to which I 
have alluded leads me to contemplate the existence of a quantum analog of the 
central limit theorem, phrased in terms not of probability distributions but of 
probability amplitudes or—equivalently but (as I will argue) more naturally— 
Wigner distributions. 

My purpose here is simply to collect together, for the convenience of future 
reference, material pertaining to the “Gaussian quantum mechanics” which will 
be central to any effort to put meat on the bare bones of my present intuition. 
The rudiments of this subject are, of course, treated in every introductory 
quantum text , 1 but closer examination turns up a number of subtleties and 
complications, and exposes a variety of methodological options, to which I will 
draw attention. I borrow freely from some informal notes 2 which were written 
in support of the thesis research of a former student 3 and inspired originally 
by quite a different train of thought. Many of the results reported in the latter 
sections of the essay are, so far as I am aware, new. 

I work mainly in one-dimension, and will borrow mainly from the quantum 
physics of free particles to lend dynamical substance to my remarks. 

1 See, for example, David Griffiths, Introduction to Quantum Mechanics 
( 1 . 995 ), Problems 2.22 (p. 50) and 2.40 (p. 69), and §3.4.2 (p. 111). 

2 “Classical motion of quantum Gaussians,” ( 1991 ). 

3 Rodney Yoder, The Phase Space Formulation of Quantum Mechanics and 
the Problem of Negative Probabilities (Reed College thesis, 1992 ). 
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Gaussian wavepackets 


1. Initial construction of the Gaussian packet. To express the circumstance that 
“a;-measurement (performed at time t = 0 with an instrument of imperfect 
resolution) has shown the particle to reside in the vicinity of the point x = a” 
we write 


P(x,0) = |V>(aa, 0) | 2 


some properly positioned and 
shaped distribution function 


and notice that such a statement supplies only limited information about the 
structure of ip{x, 0 ) itself: 

if>(x, 0 ) = \JP(x , 0 ) • : phase factor remains at present arbitrary 


The phase factor has entered with simple innocence upon the stage, but is 
destined to play a leading role as the drama unfolds. 

Whether we proceed from some tentative sense of the operating character¬ 
istics of instruments of finite resolution or seek only to model such statements 
in a concrete but analytically tractable way, it becomes fairly natural to look 
to the special case 

p ( x >°) = ^k e_i[£ ^ 12 W 

The Gaussian on the right defines the “normal distribution” with 

mean : (x) = a 

variance = (uncertainty ) 2 : ((a: — a) 2 ) = a 2 
and the associated wave function reads 

*,°) = [^] h e~i^ 2 • e^’°> (2) 


I write if(x, 0 ) = (a)^)o to draw attention to the fact that we have worked 
thus far in the ^-representation. Passage to the momentum representation is 
accomplished 

ip(p, o) = (p|V0o = J (p\ x ) dx OlVOo 

= -JK f e~* px Hx,0)dx (3) 

but cannot be carried out in detail until the phase factor has been specified . 4 
If, in the Gaussian case (2), we set a = 0 then (3) gives 

ip(p, 0) = [ Av /^ ] 2 e" 1 ^^ • e~^ ap with \ = H/2a (4) 

4 The statement 

normalization of P(x) = \i/j(x)\ 2 => normalization of Q(p ) = \ip(p)\ 2 

is, however, phase-insensitive, and is the upshot of Parseval’s theorem: see 
P. Morse & H. Feshbach, Methods of Theoretical Physics ( 1953 ), p. 456. 



Standing motion 
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whence 


W) = x ^- |[5]2 

which is again normal, but centered at the origin of p-space: 


(p) = 0 

<(p-0) 2 } = A 2 


( 5 ) 


In 

crA = Ax ■ A p = Afi (6) 

we have encountered an instance of optimal compliance with the Heisenberg 
uncertainty principle: Ax • A p > Aft. 

To achieve arbitrary placement of the origin of the normal distribution in 
momentum space—i.e., to achieve 


( 5 ) 




—it might appear most natural in place of (4) simply to write 


<p(p, 0) 



I 2 ^] 


e -s<*(p-6) 


But then 

V’OdO) = J e + ^ px p{p,0)dp (7) 

is, according to Mathematica, a mess; to achieve a simpler result I tentatively 
omit the phase factor, writing 


, _ 1 [ P~b i2 

: 6 4i X J 


} dp 


«*•<» = 75/' + '"{[a73;]A 

~ [ crv/ 2 ir ] 2 ■ e + n bx with (7 = H/2X 
The resulting distribution function 

jP ( ;r ’ 0 ) = T73^ e 


l ! 


m 2 


is (compare (5)) is again normal but centered at the origin in configuration 
space; it is precisely (1) with a = 0. These simple results indicate that if 
we desire to displace P(x,0) and Q(p, 0) simultaneously from their respective 
origins we are going to have to learn how to “manage the mess.” They bear 
directly upon the problem of the “launched Gaussian wavepacket,” to which I 
turn in §5. 

2. Standing motion of a Gaussian wavepacket. Dynamical evolution 


W) 


free particle 


t 
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is particularly easy to describe in the momentum representation (essentially 
because p and H = 5 ^ 7 p 2 trivially commute); we have 

V(p,t)= / (p|e-^( pV 2 "^) «| g ) y( g , Q) dg 

= e -i(? 2 /2 ™) t 5{q-p) 

= ip{p, 0 )-e-* (p2/2m)t ( 8 ) 

The f-dependence of tp(p, t) is so simple as to imply 

Q(p,t) = Q(p, 0) (9) 

Because the particle has been assume to move freely, almost nothing is going 
on in momentum space... but enough (see again ( 8 )) to cause interesting things 
to happen in configuration space: 

Working from 


with 


^{x,t) = -+= J e + * px <p(p,t)dp 

<pM = {[ X7d le " 


( 10 ) 


i[£i 2 
4LaJ . e 


■} 


0 -H(P 2 / 2m ) 4 


r^_l SpvnL [l+.(t/r)]p 3 ] -jap 

Lav^F-I p l 4A2 J 


NOTE : We have introduced here the 
“natural time” r = fim/2A 2 = 2ma 2 /H 


we find 

1 

'Kx,t)= [ g[ 1 + . t(t / r)]v ^ ] 2 exp' 

It becomes natural at this point to define 


1 ( x-a ) 2 \ 
4 a 2 [l+i(t/r)] J 


( 11 ) 


a(t) = cr\/l + (t/r ) 2 ( 12 ) 

Then a[l + i(t/T)\ = a(t)e l * rctan (t/r) anc j = [1 -i(t/T)\/a 2 {t), so we 

have 

P (x,t) = mx,t )\ 2 = -^L_exp{-1[^] 2 } (13) 

which is normal, with fixed mean ((x) = a: all t) and growing variance. 

That the t()(x,t) of (11) is in fact a solution of the Schrodinger equation is 
confirmed by calculation; Mathematica informs us that 

= (fir — 2ma 2 ) -(complicated factor) • 


0 by definition of r 



Relationship to the propagator 
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3. Relationship to the free particle propagator. It follows also from the definition 
t = 2ma 2 /h that 

lim a 2 [1 + i(t/r)\ = iht/2m 
<y |o 

so, working from ( 11 ), we have 


lim , 0 (a;, t) = \fa 1 2m 
& j.0 


ihtV2 tv 


f i m(x—a) 2 \ 

\h 2 1 J 


ex P \h 

m (x — a) 


(14) 


K(x, t ; a, 0 ) : the free particle propagator! 

We are inspired by this development to notice that 

S(x, t; a, 0) = f | — \ } (15) 

I 

= in the limit er J, 0 

is (according to Mathematics ) a solution of the Hamilton-Jacobi equation 

^(ff ) 2 + ff =0 (16) 


and gives 

d 2 S _ n 

dxda 2icr 2 [l-\-i(t /r)] 

I 

= — ™ in the limit a J, 0 


so ( 11 ) can be written 


V»(®,*) = V / W^ • \fl£k e ’ iS{X ' t '' afi) ( 17 ) 

which is exact, but serves in the limit to cast useful light on the origin of 
(14). The factor \J ^ g x g a originates in early work of J. H. Van Vleck , 5 

and has for more than half a century (i.e., since the invention of the Feynman 
formalism 6 ) stood guard at the portal through which quantum and classical 
mechanics communicate. But it should, in this connection, be noticed that 

5 “The correspondence principle in the statistical interpretation of quantum 
mechanics,” P. N. A. S. 14, 178 (1928). 

6 See Chapter 7 of Pauli Lectures on Physics: Volume 6. Selected Topics in 
Field Quantization. This is the English translation ( 1973 ) of material which 
Pauli presented at a research seminar held at the ETH in Zurich during the 
academic year 1950 - 51 . 
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the function S(x, t: a, 0) introduced at (15) cannot arise from orthodox classical 
mechanics, for it is complex —this we might emphasize by writing 

S(x,f,a,0) = n\[^] 2 -{^} 

—and its real/imaginary parts become entangled by the non-linearity of the 
Hamilton-Jacobi equation (16). 

The factor \J2a\/2n which intrudes at (14) derives from this circumstance: 
the propagator K(x,t;a, 0) is the solution of the Schrodinger equation which 
has by design the property that 

lim K(x, t; a, 0) = S(x — a) 

while (11) refers to the solution which can (in a certain limit, and somewhat 
informally) be said to evolve from y/S(x — a); we have 


Gaussian dx = 1 with Gaussian = 


1 =e 


ry/2'K 


J \!Gaussian dx = \j2a\f2it 


which captures the analytical essence of the situation. Pursuing this remark in 
finer detail: it follows from (11) that 


ip(x,0) = 


7\/ 2n 


exp | — \ | = \/Gaussian (18) 

= \J5{x — a) in the limit a J, 0 


and by computation 


ip(x,t) = / K(x,t;y,0)i/)(y,0)dy 


(19) 


. - (*-«) 

JIL p h 2 t . . 
iht \/ (j-</2n 


1 


dy 


4:71 a 2 it ay/2n 


1 f e ~ /t)]y 2 -2[a-i(T/t)x\y+[a 2 -i(T/t)x 2 }) 


dy 


r 1 / 4ff 2 f 1 la-i(T/t)x] 2 -[l-i(T/t)][a 2 -i(T/t)x 2 } \ 

An a 2 it <7 ~a I [l-*(r/t)j J 


we do in fact (after simplifications) recover precisely (11). 

We observe finally that results obtained above by the process a J, 0 could 
equally well have been achieved by t ] oo; i.e., that (11) entails 


V 2(7'/2tt 


■ip(x,t )—> K(x,t; a, 0) for t r 


( 20 ) 
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For an electron we have 

r = ~~2 if cr set equal to the Compton length: a = ^ 
~ 2.6 x 10 -21 seconds 


so the process (20) is prompt; it becomes (by the factor (y^y) 2 = 5.3 x 10 5 ) 
even more prompt if we set 

2 

a = classical electron radius = • Compton length 

But for a grain of sand (cubic millimeter of SiC>2, which has a mass of 2.65 x 10 -3 
gm.) we have 

r = 5.0 x 10 16 seconds if a = 10 -4 centimeters (one micron) 

~ 1.6 x 10 9 years 

which is certainly not prompt; even for such a precisely located smallish lump 
of macroscopic stuff (4.4 x 10 -5 mole of quarz, with its more than 10 2 ° internal 
degrees of freedom) the process (20) takes roughly the age of the universe to 
approach to completion. 

4. Remarks concerning the “natural time” parameter. The (non-relativistic) 
quantum dynamics of a free particle supplies two dimensioned constants (m and 
H), from which it is not possible to construct a “natural time.” But particular 
solutions of the Schrodinger equation have distinctive shapes , and therefore 
supply characteristic assortments of numbers with the physical dimension of 
length; it was thus that we came to associate a 

“natural time” r = • (minimal variance) = 2m £ 

with the Gaussian solutions (11). We were motivated by (20) to look to the 
numerical value assumed by r in some characteristic cases, but (20) touches 
upon a fairly esoteric point; the simple essence of the matter was implicit already 
in (12), which written 

a 2 (t) = a 2 [l + (t/r) 2 ] 

informs us that 

_ J time required for the variance <r 2 (f) to 
( grow to twice its initial/minimal value 


We have 


a (t) —ut 


2mcr 


For an electron H/2m = cm 2 sec 


if t » r 


u = 5.8 x 10 s cm sec 1 if a = Bohr radius 
= 5 (c/137) = orbita ‘ speed 


( 21 ) 
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To set u = c is to obtain 


a = \ (Compton length) 

which establishes the sense in which the quantum mechanical expansion of a 
primeval Gaussian mimics the expansion of the universe. Note also that by 
the Heisenberg uncertainty principle A p ~ h/2a = mu; this elementary remark 
establishes a sense in which (21) is not at all surprising. 

But from another point of view (21) is surprising: The “fundamental 
solution” 7 of the heat/diffusion equation V 2 t/> = is Gaussian 

= with <?(*) = VWd 


and exposes this fact: 

a ~ f 5 : characteristic signature of a “diffusion processes” 

If, formally, we set D = — then the heat equation becomes the free-particle 
Schrodinger equation, and k(x,t) becomes precisely the propagator K(x,t), as 
defined at (14). Looking back again to (11-13) we see that it is the complexity 
of the construction a\Jl + i{t/r) that accounts for 

1 

a ~ t 2 -> a ~ t 

surprising 


5. The “launched” Gaussian wavepacket. The Gaussian packet (11) sits in one 
place and grows slowly fat. By “launching” such a packet we place ourselves in 
position to model the quantum mechanical motion of a projectile, which departs 
(neighborhood of) the origin at time t = 0 with velocity v and with momenta 
which lie in the neighborhood A p of p = mv. I discuss three distinct but 
equivalent routes to the construction of such a “launched Gaussian wavepacket.” 

It is an implication (set a = 0) of the argument that culminated in (11) 

that 


if(x,t) = 


cr[l+i(£/r)]\/27r 


/_1_x!_\ 

\ 4 a*[l+i(t/T)]] 


exp I -7 


Gaussian standing at x = 0 

= Tj; [ e + * p: V(p> t) dp 


7 Also called the “source solution;” see p. 10 and Chapter 3 of D. V. Wiclder, 
The Heat Equation (1975). 



Launched Gaussian 
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The idea now is to shift the momentum distribution (p —> p — p 0 with po = mv ) 
and then work backwards, writing 

«=m) = jnf * +4w {U i d*^H[ l 5r‘1 a }} . e -wiw dp 

■ ■ f e -ivwo Jf( 




A = ^i 1 + i ( t / 1 

C = 4^0 


_ r i 12 r _7r "| 2 „ 

-ft.A\/27r-l LaJ 


I b 2 -ac 


' 5 PYT , / [Pn+»( m /r)a] 2 -Po[l+i(t/T)] 1 

J LX P\ 4A 2 [l+i(t/r)] / 


lh\A\/2n 

L[1 5 exp { 


4cr 2 [l+i(t/r)] 


| i Po^-(Po/2m)« 1 (OS') 

' ft l+i(i/r) J G - "/ 


new term 


From 


i Po^ 
ft l + (t/r) : 


} (23) 


(24) 


= [etc.] 2 exp [-( x -vt) 2 +i±(x 2 -v 2 T 2 )\ | • exp j 

it becomes clear that the associated probability density 

P ( a; ’ i ) = i^)75F ex P{-M : ^f] 2 } 

does as anticipated describe a Gaussian which drifts to the right with speed v, 
growing fat in the familiar way as it does so. 

At time t = 0 (22) gives 

#*,°)= [^]’ e_i[f]a - e * Po1 ( 25 ) 

which is a specialized instance of (2): set a = 0 and a(x,0) = \p 0 x. If we 
were able to argue on some grounds that introduction of the phase factor en p o x 
serves to “launch ” the Gaussian packet then we should be able to recover (22) 
by dynamical propagation of (25). Returning to (19) (where the same idea was 
developed in the case p 0 = 0), we have 


K{x,t\ y,0)ip(y,0)dy 


i m (x — y)' 
™ rTi ~2 ~ t~ 
iht 


4iv<j 2 it cr\/2'K 


~ \Z erv/27r e ~^^^ ■ e lky dy with k=p 0 /h 

~ 4 y?{[ 1 -*(r/t)]p 2 +2i[-2CT 2 fc+(T/t)x]j/+[-i(T/t)x 2 ]} ^ 

[-2o 2 k+(T/t.)x] 2 + [l-i(T/t)][-i{T/t)x 2 


Act 2 


■ ± , - „„ n /_L_ [-2cr 2 k+(T/t)x] 2 + 1 -i(r/t)][-i(T/t)x 2 ] \ 

47r<T 2 it 6 P \ 4cr 2 1 -i(r/t)} J 


[(t[1 +i{t/ t)\t/2tc \ 


exnl_ £ _ | i kx—(a 2 /r)k 2 t \ 

P 4cr 2 [l+i(t/r)] ' 1 +i(t/r) J 
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which (by a 2 /r = h/2m and k = p 0 /H) does in fact precisely reproduce (22). 
But how might we have foreseen that the adjustment 


7 y/ 27T -I 


' 1 ' 
- <7\/27r- 




(26) 


serves to “launch” the standing Gaussian packet on the left? The question 
motivates me to pose a related question (which might have been posed already 
in connection with (2)): If ip(x,t) is a solution of the free particle Schrodinger 
equation ip xx + inipt = 0 (k = 2 m/h), what condition is imposed upon a(x,t) 
by the requirement that ip ■ e la be also a solution? From 

(d x + ind t )ipe ia = { ( ip xx + imp t ) +{ia xx - a 2 x ~ na t )ip + 2ia x ip x }e ia 

0 


we see that necessarily 

a xx + i' a x + dl g x ^ = 0 

In the present application (see again (22)) 

<9 log 'tp __ x _ __ mx 

dx 2a 2 /r)] 

and Mathematica assures us that a(x, t ) = does in fact satisfy 

a xx + ia l + i? f a t ~ Hr[l™t/T)] a x = 0 ( 27 ) 

This result is confirmatory of (22), but provides disappointingly little insight 
into the mechanism that lies at the base of (26). 

Many years ago I had occasion 8 to describe the structure of the group of 
transformations which send 

solutions —> solutions of the free particle Schrodinger equation 
That work -in which the Gaussian “fundamental solution” 

= vht e ~^ tx2 with 

plays a central role—is by nature an elaborate generalization of the little 
argument just concluded, and does serve to provide insight into (26). But to 
venture down that seldom-traveled road would be ask too much of my reader; I 
turn now, therefore, to a simpler line of argument—a selected detail from that 
more comprehensive work -which captures the essence of the point at issue. 

8 APPELL, GALILEAN & CONFORMAL TRANSFORMATIONS IN CLASSICAL/ 
QUANTUM FREE particle DYNAMICS (1976). I drew inspiration there from 
several sources, but especially from §6 of Widder’s introductory chapter. 
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An inertial observer glides by, contemplating the physics (our §2) of a 
standing Gaussian. The following discussion proceeds from the remark that to 
us, his Gaussian will appear to have been “launched. ” Let us suppose that our 
inertial friend—so that we can continue to use {x, t }—uses { X, T} to coordinate 
the points of spacetime, with 

x = X + vT . X = x — vt 

; inversely 

t=T T=t 


He writes (see again (11)) 


^{X,! 1 ) a [ 1+i{ T/ T )\^ eX p{ la*[l 4(T/r)]} 

to describe his Gaussian-at-the-origin, and notes that 'L(A, T) is a solution of 
the Schrodinger equation, which he writes 


ttvT +iK§p J-T = 0 


{(fx) 

Those pronouncements, in our variables, read 


}' 


4'(a: — vt, t) = 


cr[l-\-i(t/T)]'/2n 


f 1 (x-Vt) 2 1 
\ 4 <7^1 +i(t/r)] / 


expt -j 


and (since ^ and + §~ t ) 

{(kY + iK [ u k + &]}*(*-v*,t) = ° 

The latter equation does not possess free particle Schrodinger form, except in 
the trivial case v = 0. We observe, however, that we can multiply the latter 
equation by any non-zero factor without distroying its validity, and that we 
have at our disposal the operator identity (“shift rule”) 

eV {(kY + iK i v k + §i] } = {(is - v*Y + iK i v (k - v*) + (m ~ vt)] } eV 
= {(k) 2 + ^m} eip 

+ {( - 2 ( Px + iKV ) ft + (v>l - Vxx - iKV(fi x - iKVift ) |e v 


The implication is that ip(x,t) = e v • T(a: — vt, t) will satisfy the free particle 
Schrodinger equation if 

(—2 Lp x + inv) = <p x — ip xx — inv(p x — iKVip t = 0 


These conditions are readily seen to entail 

<p(x,t) = \invx — \inv 2 t + constant 
= | {mvx — \mv 2 t } + ip 0 
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so we obtain 


ip(x,t) = 


L<r[l+i(t/r)]>/27r J 


/ t)\>/2tc J 


exp • 


exp • 


iH { 


1 (x—vt) 2 

4 cr 2 [l +i{t/r)} 

bT + s( 


mvx— hmv 2 t } 


cr 2 [l+i(t/ t)] 


mvx — \mv 2 


*>]} 


which precisely reproduces (22). This line of argument traces (26) to the 
circumstance that Galilian covariance of the Schrodinger equation requires that 
the wavefunction acquires a factor when transformed. We might on dimensional 
grounds argue that the only factors available (at t = 0 ) have the form 


g (numeric) jr mvx 


and that detailed analysis has served only to establish that 

numeric = 1 


6. The associated Wigner distributions. In 1932 E. P. Wigner had occasion 9 to 
pull from his hat (or perhaps from that of Leo Szilard) the definition 

P^>{x,p) = l J ip*{x + f) e 2 * pi ip{x- £)df 


and by 1949 J. E. Moyal 10 had traced the fact that the so-called “Wigner 
distribution function” is so richly endowed with wonderful properties to the 
circumstance that P^(x,p) stands in Weyl correspondence with the density 
matrix 

hP^(x,p) < -—-* It/OW’l 

Weyl 

My present objective is to construct a description of P g aussian(®, p ). 

Looking first to the simple static Gaussian 11 


ip{x) = 



Mathematics supplies 


P^(x,p) 


2 P -H[^] 2 +[f] 2 } 


1 =e 2 '■ 


1 


(T\/27r 

|V>(^)I 2 • \ v (. p )? 


\V2ii 


(28) 


9 “On the quantum correction for thermodynamic equilibrium,” Phys. Rev. 
40 , 749 (1932). 

10 “Quantum mechanics as a statistical theory,” Proc. Camb. Phil. Soc. 45 , 
92 (1949). For historical comments, additional references and a summary of 
the mathematical/physical details see §6 in “Status and some ramifications of 
Ehrenfest’s theorem” ( 1998 ). 

11 This is (2), with a = 0. 
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where (as henceforth) I have allowed myself to make tacit use of ( 6 ): A = fi/2cr. 
These general properties of the Wigner distribution 


JJ P,p(x./p) dxdp = 1 

Pp(x,p)dp = \ip(x)\ 2 and J Pp{x,p) dx = |^(p)| s 


Pp{x,p) is bounded: \Pp{x,p)\ < 2/h 

pertain transparently to the P^,{x,p) encountered at (28), but this 
distinctive quirk” of the Wigner distribution 

P^(x,p) is not precluded from assuming negative values 


(29.1) 

(29.2) 

(29.3) 
“most 


(29.4) 


does not: (28) describes a function which is everywhere positive, and is therefore 
not a “quasi-distribution” but a true distribution function; it is, in plain words, 
a bivariate normal distribution, defined on phase space. 

Working now from the launched Gaussian wavepacket (22)—written 


f>(x, t) ■ exp { [- ^x 2 + lp 0 x\ } 


T = 


a[l-\-i6]y/2ir 



i (Po/ 2 ”b* ~1 
K 1+iQ J 


with 6 = f/r—we have 


Pp(x,p) 


Il^l 2 / expj^h^^ + e) 2 - 

•exp{ I ^ I [- 3 ij(a;- 0 2 + 

2 | y\2 J e -l A e+2BZ+C]/2ha 2 (t) ^ . 


kPo( x + 0 ]}e 2 " p£ 

jiPo{x ~ 0 ]} df 

{ A = h 

B = i[hxO-\-2p 0 cr 2 — 2pa 2 (£)] 
C = fix 2 —4p 0 xa 2 6 


= f |T [ 2 ■ [| 2 ^(t)]s exp { | 

— 1 f _ 2hmv 2 cy 2 te 1 

— 2h 2 <r 2 (t) j 

= b expj-x^ + x P p x + pW - 1 

= (30-1) 

= f»p{-5[^-»^] 2 -5[ t ? !; ] 2 } (30.2) 


It is gratifying to observe that (30) gives back (28) when we simultaneously 
turn off the drift (set v = 0 ), execute a spatial displacement x —> x — a and 
go to the origin of time (0 = 0). And that (as one readily verifies) the implied 
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marginal distribution f P^(x,p) dp reproduces precisely (24). We observe—as 
we observed already at (28)—that the prefactor 

can be written and is time-independent 

and that—surprisingly?—it is not a(t) but a = cr(0) which enters into the design 
both of the prefactor and of the exponent. We note finally that ( 30 ) describes 
a Wigner distribution which is non-negative , everywhere and always. 

7. Minimal dispersion, and the emergence of correlation. From the statement 
Aa’A p>\h : all states \ip) 

of the Heisenberg uncertainty principle it becomes natural to ask: For what 
states 1 1/>) is equality achieved? It is in response to this natural question that one 
is led to the so-called “states of minimal dispersion” which are, as it emerges, 
Gaussian; it is largely (though by no means exclusively) from this fact that 
physicists acquire their special interest in “Gaussian wavepackets.” 

In 1930 Schrodinger observed 12 that if A and B are self-adjoint operators 
associated with any pair of observables, then 

o o /AB —BA \ 2 r/AB+BA\ (n .i 2 . . 

(AA) 2 (AB) 2 > ^ —-) +{(---)-<A)(B)| ( 31 ) 

/AB - BA \ 2 

with equality if and only if it is simultaneously the case that 

• the “quantum correlation coefficient” ^ AB+BA \, _ )/\)(B) vanishes, and 

• the vectors {A — (A)}|-f/>) and {B — (B)}|^) are parallel 

These conditions, in the special case A —> x and B —> p, are readily shown 13 to 
entail—compare (25)— 

ip(x) = (x\ip) = 1 with a = Ax 


Turning now (from “quantum pre-dynamics”) to quantum dynamics , one 

sets 


ip(x, 0) = minimal dispersion wavepacket 


12 The detailed argument is reviewed in §2 of “Status and some ramifications 
of Ehrenfest’s theorem” (1998). See also the discussion in § 7.1 of Max Jammer’s 
The Conceptual Development of Quantum Mechanics (1966). 

13 See David Bohm, Quantum Mechanics (1951) § 10 . 10 ; David Griffiths, 
Introduction to Quantum Mechanics (1995) § 3 . 4.2 or virtually any other good 
quantum text. 
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and looks to the dispersive properties of 

= j K(x,t]y,Q)i>(y,Q)dy (32) 

It is a familiar fact 14 that the ground state of an oscillator is Gaussian 

V , osciiiator(a:, 0 ) = * e“ : evidently a = Ax = sjH/2mw 

and, because we are talking here about an eigenstate, we know even without 
appeal to the propagator that 

^oscillator (x,t) = [^e~^ • 

We have here exhibited a particular state of a particular system with the 
property that minimal dispersiveness is persistent. If, more generally (and 
more interestingly), we take 

ip(x, 0 ) = : displaced copy of the groundstate 

and work from (32), we at length 15 obtain 

ip(x,t) = [^]ie"^ (x " Acosa lt)2 • e“ i/(x ’ 4) 
where f(x,t) = ^ + ^ [2xA sin cot — ^A 2 sin2ujt] is periodic; then 

\lp(x,t)\ 2 = [m^]l e -^rG-Acos W t) 2 

= Gaussian, sloshing rigidly back and forth, with amplitude A 

The results just summarized are of interest not least because they are so 
atypical: generally, minimal dispersiveness is not persistent. Indeed, we saw 
already in §4 that 

AxAp = \h -> 1 + (£/ r ) 2 ( 33 ) 

free particle 

T EE ^ With U = ^ 
u m 

I propose now to discuss, from a rather novel point of view, how (33) comes 
about . 16 

14 See Bohm, §13.8. 

15 See quantum mechanics ( 1967 ), Chapter 2, pp. 89-95 for the details. 
One needs to know that the oscillator propagator can be described 

^oscillator (*, t, y , 0) = ^ 2 eX P { “ MfSWi C0S ^ ~ 2x v\ } 

16 Closely related remarks can be found in Bohm’s §10.8, but he was obliged to 
proceed without the assistance of a tool which (as I hope to demonstrate) lends 
itself particularly well to such discussion—the notion of a Wigner distribution. 
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Equation (30) has the form 

P ('v rr /\ — ^polynomial of degree 2 in variables x and p 

ipPi 


_ 2 quadratic in “displaced variables” X=x—vt and P=p—mv 


and can be notated 


with 


Pi>(x,p;t) = ^rVdetA e 





) and A = 

( * 


/ 1 


1 +e 2 ) 






-f— V clet A 


2ir cr\ 


2 

h 


(34) 


(35) 


I wrote (34) to establish contact with the standard theory of bivariate normal 
distributions , and digress now to review the bare essentials of that subject . 17 
We have these Gaussian integral formulae: 



( 1 \ 


( 1 \ 


X 


0 

h^ac - b 2 jj 

y 

X 2 

e _l {aX 2 + 2bX y + C y 2 )dxdy = 

0 

+c/(ac — b 2 ) 


xy 


—b/(ac — b 2 ) 


\y 2 / 


\+a/(ac — b 2 ) ) 


subject only to the conditions a > 0, c > 0 and ac — b 2 >0. Evidently 
F(x,y) = ^^ae-b 2 e-^ ax2+2bxv+cy2) 

= iv®JAexp{-i(j) (“ ()(*)} 


can be understood to describe a bivariate distribution, with vanishing means 



F(x, y) dxdy 


and (trivially centered) second moments—variances and “covariances”—given 

by 


XX 

-2 




yy^j F(x,y)dxdy 




A " 1 


17 See Harald Cramer, Mathematical Methods of Statistics ( 1946 ), §21.12. 
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In terms of the latter, we have 




2 2 
a yy ~ a xy 


2 2 

®yx ®xx , 


( a y a y 

-pVxVy ] 


f G x CT x pG x (Jy \ 

V -P°x a v 

Gx&x J 


\ PGxGy GyCTy J 


1 -p 2 


J x u y 

1 


y 


(36) 


where I have abandoned the phoney distinction between a xy and a yx , adopted 
the notational simplifications cr xx —> a x and a yy —> <j y , and introduced 18 the 
(dimensionless) “correlation coefficient” 


P = 


xy 

G x &y 


The distribution F(x,y) is constant on curves 


a b 
b c 


= a: A x = constant 


(37) 


which (by detA = 0 . 2 0 . 2 ( 1 1 _ p 2 ) > 0 , which entails — 1 < p < + 1 ) are in fact 
concentric ellipses, centered at the origin, and of a shape/orientation which is 
set by the spectral properties of A. One has 


eigenvalues 


O l + °l) ± i J O* - °l) 2 + 4p 2 (j2cr2 


- P 2 ) 


\ (sum) ± | (difference) 


(sum) = 




-P 2 ) 


= ~r~^ I "V + F = trA 

1 — p z L cr± at J 


(38) 


To discover the orientation of the ellipse we draw upon the fact 19 that every 
2 x 2 symmetric matrix can be written 


( cos i) —sin iJ 
Y sin cos $ 


Ai 0 \ 

f cos'd 

— sin i9 A 

0 a 2 j 

l sin d 

cos i9 J 


( \ (sum) + \ (difference) cos 2 $ 
— \ (difference) sin 2 $ 


\ (sum) 


| (difference) sin 2 $ 
\ (difference) cos 2 $ 


18 See Cramer, p. 265. 

19 See §1 of “Non-standard applications of Mohr’s construction” ( 1998 ). 
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Working from (36) and (38), we therefore have 


sin 2d = 


cos 2d = 


^PCf x@ y 


yf O* - ^y) 2 + 4 p 2 <j2(t2 


2 2 
- Vy 


+ l J 


(39) 


Looking to what (38) and (39) have to say in some special cases: if p = 0 then it 
is obvious from (36) but also an implication of (38) that the eigenvalues of A are 
l/a 2 and 1 /cr 2 , while (39) gives d = 0 (no rotation is required to diagonalize 
the already-diagonal matrix: the principal axes coincide with the coordinate 
axes): no cross-term appears in the exponent, so F(x,y) factors 

F(x,y) = f(x) ■g(y ) 


( f(x ) and g(y) are, as it happens, both Gaussian) and the random variables x 
and y have become “independent.” At p = ±1 the right side of (38) becomes 
singular, but useful information can be obtain by setting p = 1 — e (else 
p = — 1 + e) and studying the approach to singularity (limit e | 0); we have 


eigenvalues 


e 2<x 2 a 2 
1 

o'l+a 2 



(40) 


and so obtain concentric elongated ellipses of vanishing area—in short: a line, 
of 


slope = tan d = ± yj 


1—cos 2-»9 
1+cos 


_ _j_ ^y. 


(41) 


(I will not linger to resolve the sign ambituity); the random variables x and y 
have become perfectly (anti)correlated, which is to say “dependent.” 20 


Returning with these classic resources to our physical problem: comparison 
of (35) with (36) gives 




CT p 
<7 X <J F 


1 

1 -p 2 
1 1 
1+6 2 1 ~p 2 

P 1 

e 


A 2 

<j\ 


20 We are in position now to use 1 st and 2 nd moment data, whether obtained 
from experimental observation or from some theoretical J(x,y), to construct a 
“bivariate normal distribution of best fit.” I have no present reason to pursue 
the details (a variant of the program will be taken up in a companion essay), 
but see Cramer’s Chapter 21 or the resources in Mathematical “Multinormal 
Distribution” package to gain a sense of what lies down that much-traveled 
road. 
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The first two of those equations entail a x a P = 1 J l ^ 2 a\, which upon 

comparison with the third gives 


P VT+0 2 


i.e., 


1 

1 -p 2 


=i+e 2 


(42) 


whence 

<J x = o\J\ + 6 2 
<jp = X 


(43) 


which precisely reproduce (12) and its momental companion. But while (12) 
refers to a property of the Gaussian wavepacket ij)(x,t) described by (11), the 
equations (43) refer to a more complex (but physically equivalent) object: the 
P^(x,p;t) of (30). 

We are, by (42), placed in position to state (in reference to the quantum 
dynamics of a free Gaussian wavepacket) that the correlation coefficient 


r,u\ - t/r _ / ( t/T ) H- —► o at t = 0 

P'- ' v / i+(t/r) 2 { 1 — |(r/t) 2 + • • • — > 1 for t ^ t; i.e., as t | oo 

That x -measurements and /^-measurements yield results which are initially 
Mncorrelated is plausible enough, but that there exists any sense in which 


asymptotically in time, p-measurements become 
redundant with x-measurements! 

is counterintuitive... though susceptible to interpretation. Asymptotically in 
time, we have P = slope-A; i.e., 

p — mv = slope • (x — (x)) 

while it follows from (41) by (43) that 


slope 


A 

as/l+fP 


The distribution—which at finite times was constant on concentric ellipses with 
moving centers—has in the limit ft 00 become axially symmetric about the 
“line of zero slope” p = mv; we have become unable to say anything useful 
concerning the likely outcome of an a;-measurement, but retain our initial 
ability to speak usefully about p -measurements. Looking to (30.1) for direct 
support of this conclusion, we are reminded that, to the extent that P^,(x,p;t) 
has acquired axial symmetry, it has lost its normalizability ; the thing to notice 
is that the marginal distribution 


j {right side of (30.1)} dx = exp {-§ [JL_py] 2 } 


is ^-independent: it supports our conclusion at all finite times, and therefore 
does so also in the limit. 
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Looking again to (30), we notice that 


= vt 

= mv : constant 
maximal value is at all times given by | 

and that the maximal value is in fact greatest-possible for a (bivariate) Wigner 
distribution. The valuation drops to fe - 2 (60% of maximum) on the moving 
curve 


Pp(x,p; t) is maximal at the moving point 


/ x(t) 

X p(tj 


r x—vt 
L <7 

which at t = 0 reads 


p—mv 1 2 \ p—mv' |2 

A J 1 L A J — 1 

(44) 

=1 

(45) 


The latter equation describes a ellipse (center displaced a distance mv up the 
p-axis), with principal axes parallel to the coordinate axes, and 

area = 7 rerA = j (46) 


Generally, the area of the ellipse x T A x = 1 (det A > 0) can be described 


area = 7 r\/product of eigenvalues of A 1 = nV det A -1 = 7 r/Vdet j 


In connection with the general theory of bivariate Gaussian distributions we 
have already had occasion to remark that det A ' 1 = er 2 cr 2 (l — p 2 ), and from this 
it now follows (see again (35)) that the area of the ellipse (44) is given (not just 

initially but) at all times by (44): the deformation (45) -> (44) is, in fact, 

area-preserving. 

The “incompressibility of dynamical phase flow ” 21 is in classical mechanics 
a celebrated implication of the fact that dynamical phase flow is canonical, and 
has important applications especially to (classical) statistical mechanics. It is 
a notion not common to quantum mechanics, but has here been encountered 
in connection with a particular application of the “phase space formulation” of 

quantum mechanics. I now show that (45) -> (44) can, in fact, be extracted 

from the classical dynamics of a free particle: 

Generally, if 

x 0 1 _, f x = f(x 0 ,p Q ; t) 

Poj 4 \p = d(x 0 ,p 0 ;t) 

describes an invertible (not necessarily dynamical) t-parameterized map, and if 

<P{xo,Po) = 0 


21 Liouville’s theorem: see §9-8 in H. Goldstein, Classical Mechanics ( 1980 ). 
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serves implicitly to describe a curve Co inscribed on phase space, then we write 
ip(x,p) = 0 ——■> <p(x,p;t) = <p(f-\x,p;t),g-\x,p;t)) (47) 


to described the flow-induced deformation Co —► C* of the curve. The dynamical 
flow generated by the free particle Hamiltonian can be described 


Po 


x = x 0 + -p Q t 
P =Po 


“free particle sheer” 


and as an instance of (47) induces 


[a; j 2 _|_ |~ p —mv j 2 _ i j- a—(p/m)t j 2 _j_ |~ p —mv j ' 


= 1 


(48) 


But 


^_g.r = rnv] = l[ x _ vt _*_ (p _ mv)t ] 

a _ J_ 

r A m 

= [-—after simplifications 


so at (48) we have in fact recovered (45) 


(44): see the following figure. 



Figure 1: Mechanism responsible for the dynamical development 
of correlation. The figure derives from (48), in which I have set 
a, X, to and v all equal to unity, and t = {0,1, 2,3}. A similar 
graphic appears on p. 204 of Bohm’s text, but is claimed by him to 
refer only to the classical physics of a free particle, and because he 
works without knowledge of the phase space formalism he is obliged 
to be vaguely circumspect in drawing his quantum conclusions. We, 
however, are in position to identify the sense in which (48) pertains 
as directly and literally to the quantum physics of a free particle as 
it does to the classical physics. Also implicit in the figure are the 
statements 

Vx{t) = OsJ\ + (f/r ) 2 

<Jp(t) = constant 

which we associate familiarly with the quantum motion of Gaussian 
wavepackets, but are seen now to pertain equally well to the classical 
motion of Gaussian populations of free particles. 
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8. Quantum analog of Liouville’s theorem. It is a frequently reenforced lesson of 
computational experience (unsupportable, so far as I am aware, by any direct 
appeal to physical intuition) that the dialog between classical and quantum 
mechanics becomes uniquely felicitous when the Hamiltonian depends at most 
quadratically upon its arguments, and it is for this reason that most experienced 
physicists would, I anticipate, hold that the quantum/classical confluence 
developed in the preceding section and summarized in the figure is “not very 
surprising.” My objective here will be to show how the phase space formalism 
can be used to make such a train of argument clear and precise. 

In ordinary quantum mechanics the motion (in the Schodinger picture) of 
the density matrix p = can be described 

ih§- t p= [H,p] 


The preceding equation can be shown without much difficulty 22 to stand in 
Weyl correspondence with the following fairly awesome equation: 

§- t P4x,p-,t)= sSin{f(| E -^--| i -^-)}fr(® ) p)P^(*,j>;t) (49) 

in connection with which it is to be understood that H(x,p) < -> H; that 

Weyl 

sin {f (etc.)} ^ (| (etc.)} - £{§(etc .)} 3 + ^{f (etc.)} 5 - • • • 

= differential operator of infinite order 

and that — sees the ^-dependence of H but is blind to that of P, etc. The 
point to which I would draw attention is that (49) simplifies greatly 


I 

= ~ if H is quadratic in its arguments 

_ dH_dP_ _ dP dH 
dx dp dx dp 

= [H, P^} : Poisson bracket 

Equation (49) is, within the phase space formalism, the analog of—and conveys 
precisely the same information as—the Schrodinger equation. It can be written 


|jPi/,(a;,p; t ) = [H, P^\ + power series in h 


(which suggests the special utility of the formalism to semi-classical lines of 
argument) and in quadratic cases assumes precisely the structure of the Liouville 
equation 


§t D (x,p-,t ) = [H,D] 


(50) 


22 See Chapter 3, p. 110 of quantum mechanics ( 1967 ). 
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—the upshot of Liouville’s theorem, according to which the local density of any 
population of points sprinkled on phase space is, with respect to any flowing 
point, constant: 


it D (x(t),p(t);t) = ™+P?§+x^ 

_ dD dH dD | dH dP 

dt dx dp ' dp dx 

= %-[H,D] 

= 0 ( 51 ) 

Liouville’s theorem is, as previously remarked, an expression ultimately of the 
“incompressibility of classsical phase flow.” The quantum dynamical equation 
(49) can in this light be construed to be a modified analog of Liouville’s equation, 
an assertion that “phase flow in the quantum world is—except for quadratic 
Hamiltonians— not incompressible, but squishy.” 

Returning now from generalities to the Gaussian’s of immediate interest, 
we observe that the Wigner distribution encountered at (30)—which we are in 
position now to write 

n*,r,t) = - sm') (52) 

—is, almost trivially, a solution of the Liouville equation (50). And that, since 

fT — 1 

n free 2m" 

is quadratic, means that it is (to say the same thing another way) a solution 
also of the “Schrodinger equation” (49). 

9. A paradox, and its resolution. In accounting thus for the “quantum/classical 
confluence” we have achieved almost too much of a good thing. For (52) contains 
no reference to H , and describes a correctly normalized Gaussian solution of 
(49/50) for all values of a and A— even those that stand in violation of the 
Heisenberg/minimality conditions a A > ^h. But the uncertainty principle 
admits in quantum mechanics of no exceptions! 

My first approach to the resolution of this paradoxical development hinges 
on the seldom-remarked fact that only conditionally can a solution P(x,p-t) 
of (49) be interpreted to be the Wigner distribution associated with some wave 
function if(x, t). 

When one is—as we were, at beginning of §6—introduced to the “forward 
Wigner construction” 

4>{x) ——-> Pi,{x,p) = \ [ ip*(x + £) e 2 ^« ip(x - 0 df (53) 

Wigner J 

and told that it lies at the foundation of an elaborately developed “phase space 
formulation of quantum mechanics” which is entirely equivalent to the standard 
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formulation (but in some respects more attractive), it becomes natural to ask: 
“How does one pass backwards across the bridge that interconnects the two 
formalisms? How does one describe ip(x) < - P^(x,p)?” Curiously, the 

Wigner 

literature known to me is silent on this point. The desultory attention which I 
had given to the problem over a span of forty years had served only to convince 
me that, since the solution entailed a kind of de-convolution, “extraction of a 
functional square root,” it was probably hard. I was therefore amazed when 
my colleague Mark Beck (who has an experimental interest in the phase space 
formalism) referred casually, in private conversation, to this elegant solution : 23 

By Fourier transformation of (53) we have 

J P. 4 ,{x,p)e~ 2 * p Z dp= J ip*{x + £) 5(£ - £)ip{x - £) 

= ip*(x+i)ip(x-i) 

Select a point a at which f P 4 ,(a,p) dp = V’*(«) V’( a ) 7^ 0. 24 Set £ = a — x to 
obtain 

J P^(x,p)e~ 2 ^ p ^ a ~ x ^ dp = 4>*{a) ij)(2x — a) 
which by notational adjustment 2x — a 1 —> x gives 


v-(x) = [r(a)r ■ J p*w>p) eiP(x ~ a) d p 

i 

= [V’*(0)]“ 1 • J P*(f,p) e^ px dp in the special case a = 0 

Evidently 1^(0)| 2 = / P^(0,p) dp , so we have Beck’s formula—the “backward 
Wigner construction” 

e ia f i 

ip{x) = , = ' / P^,P) e nPX dp <——- P^(x,p) (54) 

JfP^dp J 

The wavefunction if) is delivered to us already normalized, fixed to within 
specification of an unphysical phase factor. 

Insertion of the P(x,p; 0) of (52) 25 into Beck’s formula (I discard the phase 


23 The following account of “Beck’s trick” is taken from §6 of “Status and 
some ramifications of Ehrenfest’s theorem” ( 1998 ). 

24 Such a point is, by / ip*{x ) ip{ x ) dx = 1, certain to exist. It is often most 
convenient (but not always possible) to—with Beck—set a = 0. 

25 It is simply to avoid irrelevant notational complexity that I have set t = 0. 
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factor) gives 

ip(x,0) 



This function is supposed to be “already normalized,” but is normalized- 
and in fact reproduces precisely (25)—if and only if [^2 + %V] = -4, and 
this is equivalent to the stipulation that the minimality condition a\ = h/2 be 
satisfied. 

We are brought thus to this resolution of our paradox: 

Each of the {a, A}-parameterized functions (52) is a solution of the Liouville 
equation (50). And each, therefore, is a solution of the “Schrodinger equation” 
(49). But it is possible to set up an association of the form 

P(x,p\ t) <-» ip(x, t ) if and only if a A = \h (55) 

Wigner 

Quantum mechanics enforces the minimality condition, which was seen already 
at (46) to be interpretable as a statement that certain ellipses (those of Figure 1) 
have area = \h. This development is superficially reminiscent of the principle 
used by Planck to “quantize” the classical mechanics of an oscillator, and of 
the subsequent “Bohr-Sommerfelcl quantization condition,” and might reward 
closer study. 

I discuss now an alternative approach to resolution of our paradox. We 
recall that (to within a dimensionally enforced factor of h, and as was first 
appreciated by Moyal 10 ) 


Wigner distribution P(x,p) <- > density matrix p (56) 

Weyl 

and that in the general (or “mixed”) case the density matrix 


p = ^ statistically weighted projection operators 

= J2 p k\^k)H>k\ 

i 

= single term of unit weight 
= IV’) (V’I in the “pure case” 


An algebraic characterization of the mixed/pure distinction is provided by the 
statement 

tr o 2 I = 1 ' n u P ure ” case 
l < 1 in the “mixed” case 


(57) 
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(Notice in this connection that 0 < tr is automatic.) Weyl transform theory 
supplies the information that if 


A <- 


Weyl 


-> A(x,p) and B 


Weyl 


B(x,p) 


then 26 


trAB= ^ Jj A(x,p)B(x,p) dxdp 


It follows that if—generalizing remarks presented at the beginning of §6—we 
write 

p < -> hP(x, p) 

Weyl 

then (57) can, within the phase space formalism, be expressed 


hffp( X ,p)P( X ,p)dxdp { 


= 1 in the “pure” case 
< 1 in the “mixed” case 


(58) 


If, in particular, we take P(x,p) to be given (at time t) by the right side of (52) 
then the ff is trivial, and we (at all times) obtain 

^ i f = 1 in the “pure” case 
4-k<t\ \ in the “mixed” case 


The gratifying implication is that 

{ < is quantum mechanically precluded 

= in the “pure” case (59) 

> ifi in the “mixed” case 

Only in the pure case is it sensible to search for an “associated wavefunction 
ip(x,ty’ (Beck has taught us how to conduct that search), and only in that 
case are we authorized to adopt the more emphatic notation P^(x,p', t). In the 
mixed case we expect to be able to write something like 


right side of (52) = ^ p k Pj, k (x,p; t) 


Weyl 

= spectral resolution of p 


The problem thus posed will, as it relates to “fat Gaussians,” be solved in §12. 
We touch here upon a particular instance of an important general problem to 
which I hope to return in the near future. Our “paradox” has, in any event, 
been laid neatly—and informatively—to rest. 


26 See quantum mechanics (1967), Chapter 2, p. 109. 
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10. Non-Gaussian wavepackets. The “wavepacket” concept is very general, and 
admits in principle of infinitely many realizations. So also is the notion of a 
“bivariate (quasi)-distribution.” The “phase space formulation” of quantum 
mechanics serves to fuse those two primitive notions. 

Talk about “wavepackets” tends to degenerate quickly into illustrative 
talk about “Gaussian wavepackets,” not because the packets encountered in 
laboratory situations are known to be Gaussian, but simply because the function 
e~ x is (in all relevant respects) analytically so tractable. Similarly, talk about 
“bivariate distributions” tends to focus upon properties of the “bivariate normal 
distribution,” but in the latter connection “analytical tractability” is a lucky 
accident; the deeper reason is that, in many applications, normal distributions 
acquire (from the central limit theorem) special importance. The phase space 
formalism permits that “special importance” to be shipped back into quantum 
mechanical discourse (or does it? That is the question which, as indicated in 
my introductory remarks, serves to motivate my present work.) 

My objectives here will be to emphasize—working all the while within the 
bounds of “analytical tractability”—that many of the methods/results which 
we have used/obtained in reference to Gaussians are in fact not specific to 
Gaussians, and to show that some of the information brought thus to light is 
of physical/mathematical interest in its own right. 

Let P(x,p) be any real-valued function of the indicated arguments, subject 
only to the normalization condition 


fJ P(x,p) dxdp = 1 


(60) 


The side-condition P(x,p ) > 0 (all x andp) would, if imposed, cause P(x,p) to 
become a “bivariate distribution function,” but will not be imposed. We might, 
for example, take P(x,p) to be given by any of the following expressions: 


P 0 {x,p) = +^ x e~^ z 

P ^P) = -2^Px e ~ hz { 1 
P 2 {x,p) = +^e-^ z { 1 

P n fop) = (-)"^e-**{L n (z)} J 

where z = + [E]* 

and L n (z) = ^e z (^z) n e~ z z n is the Laguerre polynomial of order n. We are 
assured by Mathematica that each of those functions does in fact (all positive 
<j and A) satisfy (60), and are informed additionally that in each case 


*} 

22- 


1,21 

2 2 J 


(61) 


h 


P(x,p)P(x,p) dxdp =1 if and only if <7 A = 
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—none of which remains surprising when I confess that I borrowed (61) from an 
antique discussion 27 of the phase space formulation of the quantum theory of 
an oscillator. When the minimality condition is satisfied, Po(x,p) reproduces 
precisely (28), which when (consistently with minimality) we set 

a = \/7T*- — and A = \/m 

y 2m u) Y 2m 

reads and entails 

Po(x,p) = \e-^ z with z = ^{^x) 2 + (p/m) 2 } 


backward Wigner, according to Beck 


ipo(x) = 


1 — Af-V 

—t - p 4 L cr j 

V 2 ¥ 


muj ' 
h-K . 


l 

4 


X 


oscillator groundstate (see again §7) 


Similarly 

P n (x,p) = {-) n le~^ z L n (z) 


Wigner 

ipn(%) = Y ' -Hn(f) : oscillator eigenstate 

so the functions (61) are actually not at all esoteric, and would be “well-known” 
if only the phase space formalism were! It is interesting to notice that oscillator 
eigenfunctions become localized “by action of the spring.” If we used the spring 
to assemble a copy of the ground state, but at time t = 0 snipped the spring , 
we would be returned to precisely the familiar physics of §1. But... 

If we used spring to assemble an excited oscillator state and then got 
similarly busy with our snips, we would at times t > 0 find ourselves watching 
the evolution of a non-Gaussian wavepacket. And (to return to my main point) 
we would find, moreover, that our former methods still served: we would, on 
the basis of the classical flow pattern, construct 28 

P n {x,p-1) = P n (x - ( p/m)t,p - mv ) 

27 QUANTUM MECHANICS ( 1967 ), Chapter 3, pp. 116-120. My original sources 
were U. Uhlhorn, Arkiv fur Fysik 11 , 87 (1956) §5; M. S. Bartlett & J. E. Moyal, 
Proc. Camb. Phil. Soc. 45 , 545 (1949) and the appendix to G. A. Baker, Jr., 
Phys. Rev. 109 , 2198 (1958). 

28 Notice that we could have adopted this modified definition 



without changing the force of preceding discussion. I have set a = 0 and b — mv 
in order to “lauch” our “dissociated oscillator packets.” 
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We would observe that (since H = ^p 2 for t > 0) the argument that gave 
(51) still pertains, and that (since the Hamiltonian is quadratic) it is therefore 
automatic that P n (x,p\t) satisfies the quantum dynamical equation (49). 

The statements (59) pertain not just to Po(x,p) but to each of the quasi¬ 
distributions P n (x,p), each of which is—subject only to the proviso crA > \h 
—quantum mechanically unexceptionable. It becomes interesting in this light 
to notice that 

P n (x,p) is classically precluded except in the case n = 0 
The elementary reason for this state of affairs is made evident by Figure 2: 



Figure 2: Graphs of 

. X 1 -V 

F 0 {z) = +e a* 

Fi(z) = —e~i z {l — z} 

F 2 (z) =+e~i z {l-2z+^z 2 } 


VS. \fz 


F n {z) crosses the axis n times, with the consequence that each of the 
excited state functions P n (x,p) {n = 1,2,...} becomes negative on 
concentric elliptical rings drawn on classical phase space. This is 
the reason for the “quasi-distribution’ terminology, and the reason 
that such distributions do not admit of classical interpretation. 


Venturing now a bit farther afield, we notice that the function 

/(*) = 7k Sedl (a) ( 62 ) 

are localized/normalized, and might plausibly be taken to describe the initial 
design of a wavepacket; |/(a;)| 2 looks, in fact, very “Gaussian” when plotted 
(see Figure 3), and f(x) shares with Gaussians the uncommon property of 
being self-inversive under Fourier transformation: 
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Figure 3: Plot of \f(x)\ 2 = 5 ^sech 2 (^) in the case a = l. 





Fourier 


9{p) = Vh J e iPX 9( x ) dx 


_ ' 1 ' 
L \V2tv - 


-1(E) 

O 4 y A / 


A = h/2cr 


| compare 

sech(f) Four , er > f(p) = ^j e~^f(x) dx 

= ^ sech (|) : !3=^-h/2a 


Because |/(ai)| 2 and \f(p)\ 2 are even functions of their respective arguments, it 
is immediate that 

(a: odd ) = (p odd ) = 0 

but when we look to the even moments it becomes clear that the sech-packet 
places us near the outer limits of “analytical tactability.” After a certain amount 
of experimentation (many complaints from Mathematica ) I have, however, come 
up with this trick: 


h sech 2 (f) • cosh(sa;) dx = (x°) + £(x 2 ) + C(x A ) + • 


_ TTSOt 


^-{tanpf] 


-cot[^f]} 


— 1 -I- 


4! 240 


2! 12 
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from which we conclude that Ax = ^nPJvia and A p = y / 7 r 2 / 12 / 3 : 

Ax Ap = j 2 ct/3 = • \h = 2.0944 • (63) 

We had no reason to expect the seclr 2 distribution to be minimally dispersive, 


and it is not. Evidently the normal distribution with the same variance is 

l 2 -i 


y.yj 7T 3 /6 


exp {— | 


L OLyJ 7T 2 /I 


(64) 


but if we sought the normal distribution which “best approximates” the sech 2 
distribution it would be natural to seek to 

1 2 


minimize 


i e -|(f ) 2 _ J_ sec h 2 (i) 

t\I2tt ' 2a \a) 


dx 


The integral, however, appears to be intractable. This train of thought did, 
however, lead me to set 


/ 




equal to 


2 ii Sech 2 (g) 


1 2 


dx 


3a 


and thus (on admittedly the most tenuous of grounds) directed my attention 
to a normal distribution 


3a/\/2 


exp • 


1 2 > 


3a:/2v / 7r 


(65) 


does, on the evidence of Figure 4, appear to provide a better fit. 


Looking to the associated Wigner function, we (after some elementary 
manipulation 29 ) have 

POO 

p f(x,p) = 1 ^ 2 / sech(^) sech(^) cos (fp£) d£ 

Jo " - V -' 

= M cosh ( I a : )+ cosh ( l^)]" 1 

The integral appears to lie beyond Mathematical capability, but can be 
discovered in Gradshteyn & R.yzhik , 30 whose reported evaluation (after a 
correction) entails 




sinh(^) sin h (^) 


( 66 ) 


29 The intent of the manipulation is to make manifest the reality of the Wigner 
function. For the identity, see Abramowitz & Stegun, 4.5.43. 

30 Table of Integrals, Series, and Products ( 1965 ), where at 3.983.1 one reads 



cos ax dx 
b ch j3x + c 


(missing factor of 4?) 


7rsin ()) arch |) 

fty/c 2 — b 2 shf 


c> b > 0 
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Figure 4: Comparisons of (64) — above—and (65) with \f(x)\ 2 , in 
the case a = 1. The latter appears to the eye to provide the better 
fit. 


The accuracy of (66) is supported by the observations that (according to 
Mathematics) J J Pf(x,p) dxdp = 1 and, as (29.3) requires, 

\P f (x,p)\<P f (0,0)=l 


More convincingly, insertion of (66) into Beck’s formula (54) is readily found to 
give back precisely the f(x) of (62). When written out as functions, Pf(x,p) 
and its “best fitting” Gaussian counterpart (I work from (52), with t = v = 0 
and A = h/2a) 


Pg( x iP) = fexp{- i[f] 2 


1 

2 



a=2>a./2-\/H 


(67) 


resemble one another not at all, but when plotted (see Figure 5) their difference 
is almost imperceptible. They do, however, differ in one important respect: 
P g (x,p) is everywhere positive, but Pf(x,p) vanishes/changes sign on the 
hyperbolic contours defined by 


xp = n\ 


n = ±1, ±2, ±3,... 
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Figure 5: The function Pf(x,p) is plotted above, and its “best 
Gaussian approximant v P g (x,p) below. In preparing the figure I 
have set a = h = 1. 


More specifically, we have 


Pf{x,p) > 0 
Pf(x,p) < 0 
Pf(x,p) > 0 


_1 < 4xp < + i 
_ 2 < 4xp < _1 or 
_ 3 < 4xp < _ 2 or 


+ 1 < 4xp < +2 

+ 2 <i^<+3 > 


( 68 ) 


The Wigner function Pf(x,p) is a “quasi-distribution” with emphasis on the 
quasi, and is classically disallowed, but in the latter connection we notice that 
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the central positive region expands to enclose all of phase space in the classical 
limit h 10. 

We know on the basis of the general Galilean transform argument advanced 
in §5 that to “launch” the wavepacket f(x)— because to launch any wavepacket 
—we have only to make the adjustment 

/Or) —- f(x) ■ ( 69 . 1 ) 

launch 

which in the phase space formulation of quantum mechanics (see again (53), 
the equation which, at the beginning of § 6 , served to define Pf(x,p)) becomes 

Pf(x,p) -> Pf(x,p — mv) (69.2) 

launch 


The preceding remark is consistent with the pattern of events foreshadowed 
already in classical mechanics , 31 where the Hamiltonian theory of Galilean 
transformations is found to be simpler than the corresponding Lagrangian 
theory; the later requires that a gauge transformation be built into the definition 
of a Galilean transformation. That adjustment entails adjustment 

S —> S + { — mvx + \mv 2 t } 

of the classical action, and it is to this development that both (69.1) and (69.2) 
can ultimately be traced. 

To obtain a description of the (free) dynamical evolution of if(x, 0) = f(x) 
we might attempt to proceed from (19), writing 


4>(x,0) 


= / \/m e *** * 4 =sech(^)dy 


free dynamical 


y/2a ‘ 


but the integral appears to be intractable. Alternatively, we might pass to the 
phase space formalism, write 


Pf(x,p; 0) 


free dynamical 


Pf{x,p\ t ) = Pf(x- ^ pt,p ; 0) 


sinh(^)sinh(^p) 


(70) 


X — >x— -jj \rpt 


and attempt to recover ip(x, t) by appeal to Beck’s formula (54). But again, the 
integrals encountered in the final step appear (except at t = 0 ) to be intractable; 
we have encountered a problem in which the phase space formalism is easier to 
carry to completion than the standard formalism. 


11. A conjecture—stated and refuted. For a while—intuitively, and on the 

slender evidence afforded by the harmonic oscillator—it seemed to me plausible 
to conjecture that 

Pground state (x,p) is, for all systems, everywhere non-negative 


31 See classical mechanics ( 1983 ), p. 252-254. 
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But we are in position now to show by counterexample that such a conjecture 
is untenable. 

While integrals involving sechx, sech 2 x, etc. tend, as we have seen, to 
be less tractable than those involving exp {— a; 2 }, the derivative properties of 
the former functions are in some respects more attractive than those of the 
Gaussian; one has 


(lx) s i n h x = sinh x — 2 sinh 3 x 

( 7 2 2 2 4 

4-) sinh x = 4 sinh x — 6 sinh x 


which on their right sides exhibit what we might call the property of “non-linear 
functional closure” (and an absence of radicals). The latter identity lies at the 
heart of the construction of a famous solitonic solution of the Korteweg-deVries 
equation ut + uu x + ku xxx = 0, 32 but it is the former which is of present interest; 
it entails 

(s) 2 sinh(f) = ^{l- 2 sinh 2 (g)}sinh(g) 


or again 


(- £-(£-) 2 - -^sinh 2 (^))sinh(^) 

1 2m \ ax / mar V ot / J \ a / 


2ma 2 


sin. 


MS) 


We conclude that (62) describes a localized eigenstate /( x) of the system 

H(x,p) = 5 bP 2+U (x) 

U(x) = -C/ mi n sinh 2 (g) 

in which the “attractive sech 2 -potential” has been “tuned” by setting 


G min a 2 = h 2 /r 


The associated eigenvalue is 

E = ~\U m in 

Because f(x) is node-free, we conclude that fix) is in fact the ground state of 
the system in question. But the associated Wigner function Pf(x,p) has already 
been shown not to be everywhere non-negative. My conjecture is refuted. 

One remark before I take leave of this topic: Richard Crandall 33 has drawn 
attention to a wonderful property shared by the quantum potentials 

Un(x) = -^N(N + 1) sinh 2 (I) : AT = 1,2,3,... 

32 See classical field theory ( 1979 ), p. 55. 

33 R. E. Crandall & B. R. Litt, “Reassembly and time advance in rcflectionless 
scattering,” Ann. of Phys. 146, 458 (1983); R. E. Crandall, “Exact propagator 
for reflectionless potentials,” J. Phys. A 16, 3005 (1983). 
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of which we have encountered the leading instance (N = 1). Citing Morse & 
Feshbach , 34 Crandall remarks that Un(x ) admits of exactly N bound states, 
and that the energies associated with those states can be described 

E N ,n = -^(N-n) 2 : n = 0 , 1 ,..., N - 1 

For the system considered in connection with my ill-fated conjecture there is, 
on these grounds, a single bound state, which is perforce the ground state: 
no allusion to a “nodal folk theorem” is in fact required. It was, by the way, 
Crandall’s interest in this problem which stimulated me to look into some of 
the associated classical mechanics . 35 

12. Fat Gaussians are mixtures. For the purposes of this discussion I adopt a 
notational refinement: we agree to 

write <j and A when the minimality condition crA = \h is in force 
write u and A otherwise 

We note that it is always possible to achieve minimality by rescaling ; i.e., to 
write 

a = b<7 and A = bX : b > 0 is the “fatness parameter” 

Look now again to the familiar bivariate normal distribution function 

PI * X ’P) = F537 ex p{-IK]’-111]’} (™) 

= pj-ibf-iKl 2 } 

which will satisfy the quantum mechanically enforced boundedness condition 
(29.3) if and only if b > 1 (which we henceforth assume to be the case), but 
was seen at (59) to be the Wigner transform of a wavefunction ^>( x ) only in 
the case b = 1. The question now before us - first posed at the end of §9—is 
this: What quantum mechanical meaning can be assigned to the “fat Gaussian” 
distributions which arise from (70) when b > 1? 

Introduce dimensionless variables 

x = x/a and p = p/X 

and notice that 

{x/a) 2 + (p/A ) 2 = x 2 + p 2 = constant 

inscribes an ellipse on the {a:,p}-plane, but a circle on the {i:,p}-plane. 
Introduce polar coordinates 

x = r cos 6 
p = r sin 9 

34 Methods of Theoretical Physics ( 1953 ), p. 1654. 

35 CLASSICAL MECHANICS ( 1983 ), Appendix C. 
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and notice that (70) can be written 

P(x,p) = l exp{- ^r 2 } 

Finally, introduce 


z = r 2 


to obtain 


P(x,p) = p- fe 262 ; 


(71) 


and notice (see again (61)) that one recovers 

I 


= oscillator ground state 22 at 6 J, 1 


Evidently dxdp = crA • d£’dp = §rdrdd = jdzdO , in consequence of which we 


recover 


P(x,p) dxdp = 27t / e w?~dz = 1 

Jo 


So much by way of preparation. 

Look now to the Wigner functions 


P„(cr,p) = (-)"£ e~* x L n (z) (72) 

which were at (61) directed to our attention by—of all things! -the quantum 
theory of oscillators. Notice that 

J j Pm{x,p)P n {x,p) dxdp = (-) m+ "d) 2 f 27 T J e~ z L m (z)L n (z) dz 


= id 

h rnn 


(73) 


by the orthogonality of the Laguerre polynomials. 36 The idea now (see again 
the end of §9) is to write 

P{x,p) = ^p n P„(a;,p) 

n 

Pn = h f f P(x,p)P n (x,p) dxdp 


which, when P(x,p) is given by (71), becomes (after simplifications) 

fat Gaussian = E p n P n (x,p) (74) 

n 

Pn = (-) n ^r [ e~^^ 1 + ^ z L n (z)dz 

Jo 

36 For a nice account of the properties of these polynomials see Chapter 23 
of J. Spanier & K. B. Oldham, An Atlas of Functions ( 1987 ). 
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The integral is tabulated , 37 and gives 

^ = pMferr (75) 

concerning which several elementary remarks (the net effect of which is to 
inspire confidence in the accuracy of (75)) are in order: 

p n = 1 : all values of k 

n 

p n are non-negative for all n if and only if b > 1 

if b = 1 then po = 1 and all other p n vanish 

The second of the preceding points merits special comment: if b < 1 were 
allowed, then not only would we stand in violation of the uncertainty principle, 
we would acquire an obligaton to try to make sense of “negative statistical 
weights” in the sense of ordinary probability theory —a notion for which even 
quantum mechanics makes no provision. 

I conclude this discussion with miscellaneous comments of a more general 
nature: 

The issue before us has been pre-dynamical: we have been concerned with 
the quantum mechanical interpretation of Wigner distributions of a certain (“fat 
Gaussian”) design; we have not been concerned with the dynamical motion/flow 
of such a distribution; no Hamiltonian has been specified. Reference to 
“oscillators” is relevant only to this extent: the conditions 

P(x,p) = constant and H osc (x,p) = E 

give rise to identical populations of (elliptical) curves. It would cut closer to 
the analytical heart of the matter to observe (say) that 38 

n 

53 ( n k )H 2k (x)H 2n _ 2k (y) = (—) n n\L n (x 2 + y 2 ) 
k -0 


and that the Hermit polynomials H n (x ) = (— ) n e x2 (^)"e * 2 —made familiar 
to us by the quantum theory of oscillators—are in all respects very “Gaussian.” 

37 See Gradshteyn & R.yzhik, 7.414.5, where we are informed that 
e~ az L n (z) dz = (a — l) n a~ n ~ 1 : K[a] > 0 

38 See A. Erdelyi et al. Higher Transcendental Functions II ( 1953 ), 10.13.32. 
The identity quoted is intended merely to typify the intricately patterned 
relationships which link the Hermite with Laguerre polynomials. 
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Straightforward extension of the method described above permits one to 
accomplish the “spectral analysis” (not just of “fat Gaussians” but) of any 
Wigner function of the specialized form 

P{x,p) = f{z) with 2 = z(x,p) = ( x/a) 2 + (p/ A) 2 

But it leaves us still powerless to attack the general problem posed at the end 
of §9—powerless to discover weights p n and functions P n (x,p) which permit 
one to write 

p (x,p) = ^ p n P n {x,p) 

n 

when the Wigner function P(x,p) is arbitrary. 

The question arises: How—physically—would one prepare the mixture of 
states to which a “fat Gaussian” has been found to refer? One thinks naturally 
of “simultaneous x and p measurements of less-than-optimal precision,” but 
(in the absence of a well worked out phase space formulation of the quantum 
theory of measurement) I see no way to make that idea precise. I describe an 
alternative procedure which does in fact hinge upon the quantum physics of 
oscillators. Considering a (whence also A = H/2a) to have been given, select 
an oscillator in which the mass and frequency have been “tuned” in such a way 
as to achieve 


H(x,p) = 2 bP 2 + = j^{(a:/(j) 2 + (p/A) 2 } (76) 

s -V-' 

z 


Select an oscillator from a thermalized population of such oscillators. It will be 
in state |n) with probability 


Pn = 


JL p-0(n+h) 


(3 = 


huj 

kT 


and since the partition function 


Z = ^V^+5) 

n=0 


e 2 ^ 

1 — e - 0 


we have 

Pn = (1 - e~ /J ) [e^] 

which if we set 



i. e., 


b 2 


l+e^ 

1 - e -/ 3 


coth ( \ f3 ) 


(77) 


is found by quick algebra to possess precisely the structure of (75)! That “fat 
Gaussians” lurk within the thermal physics of a quantum oscillator, and become 
minimally dispersed ground state Gaussians as T J. 0, is a result which I find 
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quite satisfying—made the more so by the following observation: Classically we 
expect to have \kT = E = imw 2 (amplitude) 2 , giving 

(amplitude ) 2 = ^ (78.1) 

On the other hand, it was remarked already on p. 28 and follows from (76) that 
cr 2 = h/2mu, while (77) entails 



~ 2 jrk for temperatures of “classical magnitude” kT » hio 

so we have 

* 2 = (H 2 - & (78.2) 

The point to notice is that if we accept the identification 

a <—» amplitude 

then (78.1) and (78.2) say precisely the same thing Z 39 

The idea now is to adjust the temperature T so as to assign the desired 
value to the “fatness constant” b and then, at time t = 0 , to “snip the spring.” 
If T = 0 then 6=1 and we end up with a Gaussian of minimal dispersion (pure 
ground state), but if T > 0 (which entails 6 > 1) we produce a thermal mixture 
of oscillator states, represented within the phase space formalism by a Wigner 
function of “fat Gaussian” design... which then moves off under the dynamical 
control of whatever Hamiltonian we have build into our apparatus (i.e., which 
we abruptly turn on when we snipped the spring; the default Hamiltonian is 
would be that of a free particle). 

13. “Entropy” of a Gaussian wavepacket. It was Boltzmann—working during 
the decade prior to 1877 toward a kinetic theoretic interpretation of the entropy 
concept—who first drew the attention of physicists to the importance of the 
construction 

H = - J^Pn^OgPn = “(log p n ) 

in connection with which it is indispensable to notice that (obviously, else by 
l’Hopital’s Rule) 

lim p log p = lim p log p = 0 

pT 1 pfo 

And it was Claude Shannon ( 1948 ) who first appreciated that Boltzmann’s 
construction possesses an abstract significance far deeper, and a utility far more 
diverse, than that originally contemplated. 


39 p or re lated discussion see §30 in L. D. Landau & E. M. Lifshitz, Statistical 
Physics ( 1958 ). 
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Quantum mechanics—which is nothing if not an intensely probabilistic 
subject—presents an abundance of distribution functions of various types. One 
might therefore expect importance to attach to expressions of (say) the form 

H bP] = ~J \if{x,t)\ 2 \og\if{x,t )\ 2 dx (79) 

and is surprised to discover that the notion thus expressed appears to be—or 
at least until quite recently have been 40 —absent from the quantum mechanical 
literature. Within the phase space formalism one is similarly inspired to write 

H\P]^-jjP(^otP(x, K t)d X d P (80) 

which, though absent from the quantum mechanical literature, is formally 
identical to an equation standard to classical statistical mechanics : 41 at thermal 
equilibrium one expects to have P(x,p) = Ze~Tpp E ^ x,p ^ which gives 

H[P\ = ^r(E) + log z 

and by a celebrated line of argument 42 discovers that the expression on the 
right admits of the interpretation 

entropy 
= k 

It is in allusion to this fact that in modern literature 43 one encounters, even in 
contexts which have nothing to do with thermodynamics, the terminology 


— logP(event) = “event-information” 

H[P] = — PlogP = — (logP) = mean event-information 
events = “entropy” 

40 I will have occasion later to cite papers by C. Adami and others, which 
began to appear during the mid-iggo’s. 

41 Quantum statistical mechanics is nearly as old as quantum mechanics itself 
(arguably older, since it was a statistical mechanical problem that inspired 
Planck), and already by ig 27 it had been pointed out by J. von Neumann 
(in his Mathematische Begriindung der Quantaenmechanik ; see p. 394 in the 
English translation) that in thermalized situations it makes sense to write 

p=^e~ wrH with Z = tre~ wrH 


and to set 

thermodynamic entropy = —/ctrplogp 

and it was to facilitate interpretion of these statements that Wigner was led 
to the invention of the Wigner transform. So when I say “absent from the 
quantum literature” I should emphasize that I mean absent in contexts which 
make no reference to thermal equalibrium. 

42 See statistical PHYSICS (ig 6 g), Chapter 3, pp. 35-38. 

43 See, for example, “213.B, Information Theory” in Encyclopedic Dictionary 
of Mathematics (igg3). 
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H[P\ is by nature a real number-valued functional of the distribution P(event). 

The relative unimportance of (79) can, I suppose, be accounted for as 
follows: write ip = Ae^ s . Then (79) reads H[ip] = — f \A\ 2 log \A\ 2 dx, from 
which all reference to the phase factor S has dropped away. But it is within 
that phase factor—closely related to the classical action—that much of the 
most characteristically quantum mechanical “good stuff” is known to reside. 
The dismissive force of that remark evaporates when one turns from (79) to 
(80), for folded into the functional structure of P,p(x,p: t), is all the physically 
significant information conveyed by both A{x,t) and S(x,t ); this follows from 
the invertibility of the Wigner transform (Beck’s formula). 

On another occasion I will look (within the phase space formalism) to 
some of the more general quantum applications of the entropy concept; here 
my intention is simply to plant seeds for future harvest, and to examine entropic 
aspects of the Gaussian wavepackets which are at present my major concern 

It serves my present needs to adopt a notational refinement of (1): 

G(x;a) =e"{ 2 [ ^] 2 + lo gOv^)} 

= centered Gaussian distribution with variance a 2 


Evidently 


H [G] = 5 { 1 + log(27r)} + log cr 
= 1.41894 + log cr 


(81) 


For purposes of comparison, consider the flat “particle-in-a-box distribution” 


b(x; a) = \ 2 ia 


x 2 < a 2 

outside that interval 


Obviously (x) = 0, and by quick calculation (x 2 ) = |a 2 , so 

B(x\ a) = b(x; aV 3) = centered box distribution with variance cr 2 
We compute 


H[B] = 


r+a 


h lo g In dx = lo g( 2a ) 


jy /3 


= log(2V3) + log (7 
= 1.24245 +logo- < H[G) 


all values of a 


Or consider this distribution, borrowed from discussion subsequent to (62) in 

§ 10 : 

F ( x ^) = [^ se ch 2 (S)l 

J Ot=y/ 12/'K z G 

= centered sech 2 distribution with variance cr 2 
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We compute 


H[F] 


i sech 2 (g) log sech(g) dx + log( 2 a) 


J ot=^J 12/'k <2 (t 


= 2 - 2 log 2 + log( 2 y/l 2 / 7 r 2 ) + log a 
= 1.40458 + log er < i? [G] : all values of a 


The preceding inequalities are illustrative of a general inequality, which emerges 
from the solution of this problem: 


r <*°> = i 

maximize H[p(x)\ subject to the constraints < (x 1 ) = 0 

1 (x 2 ) = a 2 

Using the method of Lagrange multipliers to manage the constraints, we are 
led 44 to write 6 J { p log p + a + bx + cx 2 } dx = 0 whence 


logp + (1 + a) + bx + cx 2 } dp = 0 


giving 

p(x) = e ~U 1 + a '>+ b *+™ 2 } 

and we find ourselves obligated by the constaints to set c = b = 0 and 
e -d+a) _ —i . i_ e reproduce (1). The proposition 45 that 

(TV 27T 

Among the distributions p(x) of specified variance, 
entropy H[p(x)\ is maximized at the Gaussian 

lends the Gaussian an importance which (it seems to me) cuts much deeper 
than mere “analytical tractability;” it is on these grounds not a “distribution 
among equals” but by birthright entitled to a little occasional “tyranny.” 

Returning with (81) to (13), we see that the standing motion of a Gaussian 
wavepacket lends the associated entropy a time-dependence which can be 
described 


H[\if(x,t)\ 2 } = i{l+ log(27r)} +logcr(f) (82.1) 

<r(t) = osJ\ + (t/r) 2 

44 My analytical method is a variant of one devised by Boltzmann to extract 
the Maxwell distribution from the H-theorem; see D. ter Haar, Elements of 
Statistical Mechanics ( 1954 ), p. 20, or Chapter 2, p. 34 in statistical physics 
( 1969 ). Also Appendix 14 in Max Born’s Natural Philosophy of Cause and 
Chance ( 1949 ). 

45 Albyn Jones assures me that this result is “reasonably well known,” but 
could suggest no published source, nor have I attempted to locate one. 
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and (asymptotically) grows as logt as the packet becomes spatially dispersed. 
In the momentum representation (see again (9) and (5)) one on the other hand 
obtains 

H \W{P,i)\ 2 ] = H 1 + 1 °g( 27r )} + lo S A (82.2) 

which is constant in time. 

The phase space formalism leads to conclusions which are relatively more 
interesting, and made so because they embrace all of the “missing physics;” it 
looks, after all, not to marginal distributions but to the underlying joint (quasi-) 
distribution. In the case of a “freely moving launched Gaussian wavepacket” 
we at (30.2) had 

P^(x,p;t)= y^expj-i[f] 2 } 

(here as before: A = fi/2er, 9 = t/r , X = x — vt and P = p = mv ) and if we 
allow ourselves to work from (80) obtain 

H[P*]= 11 - 9%] 2 + ±[%] 2 + log(a\2n)} dXdP 

= M X2 ) - M XP ) + P 2 > + log(aA2^) 

= - - 9 2 + + log(crA27r) 

= 1 + log(crA27r) (83) 

The structure of (83) inspires the following series of comments, which will in 
turn motivate some adjustments, and open some doors: 

I draw attention first to the fact that at (80)—as previously also at (79)— 
we committed a major breach of etiquette: we allowed ourselves to form the 
logarithm of an expression which bears physical dimension. The logarithm is 
uniquely forgiving of such indiscretions, but our faux pas should not be allowed 
to stand uncorrected. Whether one argues from the defintion (53) of the Wigner 
transform or from ff P(x,p ) dxdp = 1, it is clear that 

dimP^(z,p) = ^ 

But it is an implication of the familiar boundedness condition (29.3) (i.e., of 
the Heisenberg uncertainty principle) that 

footprint of most compact Wigner function = | 

and it becomes therefore natural to write 

1 = JJ P(x, p) dxdp = J jp{x , p) 

P(x,P) = %P(x,P) 
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This obvious variant of (80) H[P) = —ff PlogPdfl (I adopt here the fairly 
standard abbreviation dfl = j^dxdp) is free of the defect to which I have drawn 
attention, 46 and entails 


H[P ] = H[P ] — log(/i/2) 

Thus are we led, in place of (83), to write 

H [^gaussian ] = 1 + log(erA27r) - \og(h/2) 

= 1 because ctA27t = h/2 


(84) 


(85) 


To show that we stand now in the presence of a contraction I have now 
to digress a bit: Let \ip n ) refer to some/any set of orthonormal states, and let 
p„ = IV’nXV’nl refer to the associated set of associated projection operators. 
One has 

Pn = Pn ; PmPn = 0 (lll/n) J £ P„ = 1 (86.1) 

where the former statements imply trace-wise orthonormality 

PmPn = firnn (86.2) 

and where the latter says simply that the {|Xn)} are complete in whatever space 
they span, whether infinite or finite dimensional. Let £ a n p n be some/any 
linear combination of projection operators, and let /(•) be some/any formal 
power series in a single variable; it follows formally from ( 86 . 1 ) that 47 

/(£«nP„) = £/(On) Pn (87) 

Let the density matrix p = £ p n p r , refer to some/any statistically weighted 
mixture of such states. It is (formally) an implication of (87) that 

e £(logPn)p„ (8g) 

which lends meaning in orthonormal cases to the expression “log p.” We find 
ourselves therefore in position to write 

H\p] = -tr(plogp) (89.1) 

= -tr { l °SPn) PmPn} 

m,n 

= -£Pnl0gPn (89.2) 

46 It is interesting to note that (79), in the absence of a generally available 
“natural length,” admits of no such adjustment. 

47 The following statement is more familiar as encountered within the theory 
of diagonal matrices. 
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We conclude that the entropy of a mixed state has nothing to do with “quantum 
statistics,” but everything to do with the mixture statistics. The entropy of a 
mixture is maximal when 48 

p n = T with N = tr 1 = dimension of spanned space : all n 
in which case one has 

-H>]max = logiV 

The entropy becomes minimal in the “pure case” p 2 = p\ i.e.; when one of the 
p n is unity and the rest vanish, in which case one has 

H[p] mi „ = 0 (90) 

which is contradicted by (85). The force of the contradiction is, however, 
blunted by the circumstance that ( 88 ) becomes strictly meaningless in the pure 
case, and the accuracy of (89.2) is to that extent compromised. Let us look 
therefore to a case in which none of the p n vanish—the case of a thermalized 
oscillator. 

We observed already near the end of §12 that in for a thermalized quantum 
oscillator one has 

Z = [2sinhg/3] 1 and p n = (1 — e~ 0 )e~ n0 
with (3 = hio/kT. Working from (89.2) we are led therefore to write 

Hose = 53(1 - e~ 0 )e~ n0 { - log(l - e~ 0 ) + n(3) 

= log(l - e~ 0 ) + (1 - e~ 0 ) ( - &%) 53 e~ n0 
= — l°g(l — e ~ 0 ) + P/(e 0 — 1) (91) 

which is readily brought to a form 

= — log 2sinh|/3 + i/3coth ^/3 

which a thermodynamicist would have extracted more swiftly from 

= { l -P%} X °Z Z 

We observe that (91) does conform to (90) in the limit /3 T oo (i.e., as T j 0). 
We know,on the other hand, that a thermalized mixture of quantum states gives 
rise within the phase space formalism to the “fat Gaussian” 

p (x,p;t) = ^cx P {- - e i} 2 - i[w] 2 } 

48 For the detailed argument see A. I. Khinchin, Mathematical Foundations 
of Information Theory ( 1957 ), p. 4. 
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and by trivial adjustment (send a > <r = bo and A i—> A = bX) of the calculation 
which gave (83) we have 

H [-^fat gaussian mixture ] ^ A log(<rA27r) log(/l/ 2 ) 

= 1 + log b 2 
= 1 + log C0tll?/4 

= 1 + log {(1 + e -/3 )/(l — e _/3 )} (92) 

It is gratifying to observe in reference to (85) that 


H[P( a t gaussian mixture ] > H Impure gaussian ], wi * h equality in the limit (3 ] OO 


but the limit does not conform to (90), and -which is more to the point—the 
right sides of (91) and (92) are distinct. How did that come to be so? 

The answer has to do with some fundamentals of the Weyl correspondence: 

from 


P{x,p) 


- P 

Weyl 


it does not follow that 


P{x,p) \ogP(x,p) < -—-- plogp 

Weyl 

because more generally 

A(x,p) < -A does not imply f(A(x,p )) <-/(A) (93) 

Weyl v 7 Weyl 

That (invariably) 

Weyl transform of sum = sum of Weyl transforms (94.1) 

but (except in some special cases) 

Weyl transform of product ^ product of Weyl transforms (94.2) 
is seen from 49 

to follow from the corresponding elementary properties of the trace. The 
circumstance highlighted at (93) has sometimes been held (by von Neumann and 
others: see J. R. Shewell, “On the formation of quantum-mechanical operators,” 

49 The following description of the inverse Weyl transform —which can, by the 
way, be made the basis of an alternative derivation of Beck’s formula (54)—can 
be extracted from material presented on pp. 101-103 of QUANTUM MECHANICS 
( 1967 ), Chapter 3. 
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AJP 27, 16 (1959)) to comprise a defect of the Weyl correspondence procedure, 
though I fail to see why; starting from the classical Hamiltonian, one has 

H(x,p) -> H does not imply <-e~ /3H 

Weyl ‘ Weyl 

and recognizes that if it were otherwise then quantum mechanics and classical 
statistical mechanics would be the same theory! Standardly, one gives meaning 
to e~ ,3H either by appeal to the spectral resolution of H 

H = E n \ri)(n\ => e _/3H = X) e_/3E,l l n )( n l 

or (in favorable cases) by appeal to the methods of operator algebra, or by 
appeal to Feynman’s sum-over-paths technique. The former procedure, when 
brought to bear upon plogp places one in position to exploit the additive 
property (94.1) of the Weyl transform; writing 

P = J2Pn\n)(n\ — » plogp= EPnlogp„|n)(n| 

Weyl/Wigner 

= J2Pn PnPn{x,p) 

we would be led back again to precisely (89.2), and to the conclusion that “the 
entropy of a mixed state has... to do with the mixture statistics.” 

We conclude that (89) and (80) speak about different things: the former 
alludes (in quantum dress) to the familiar stuff of ordinary statistical mechanics, 
while the latter presents us with an opportunity to say something new. To 
emphasize the peculiarly quantum mechanical character of that “something 
new” I will henceforth write 


H q [P] = -Jj P(x,p)\ogP(x,p)dxdp (96) 

where the subscript is intended to suggest “quantum.” The expression on the 
right side of (96) is recommended to our attention with the same force, and for 
the same function-theoretic reasons, that serve invariably—in all contexts—to 
recommend (logP). But I chose my phraseology with care when I said that 
H q [P] is U peculiarly quantum mechanical;” P(x,p) refers in (96) to a Wigner 
function (or “quasi-distribution”), and Wigner functions can assume negative 
values, with the result that 

H q [P] will, in general, be complex 
From the elementary observation that 


P(x,p) = 


\P(x,p)\ at points where P(x,p) > 0 
e m \P(x,p)\ at points where P(x,p) < 0 
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we conclude that 


H q [P] 


P(x,p) log \P(x,p)\ dxdp — in 


P(x,p) dxdp 


neg 


(97) 


where the first Jf ranges over all of phase space, while the second ranges only 
over the domain on which P(x,p) < 0. 

To gain some feeling for what (97) has to say in concrete cases, we look 
again to the Wigner functions (72) of a harmonic oscillator. Borrowing some 
computational technique from §12, we for the ground state have 

H q [P,'o]=~l[ { e_ ^ z }{~ \ z \ dz ~ \ [ |e _ 5 z |dz 

Jo ^ J J J empty L J 

= l + i0 

which simply reproduces (85). For the first excited state (for which the Wigner 
function becomes negative on the interval 0 < z < 1) we have 

H q [P 1 ] = -J j j — e“5 2 [l — 2 ] }{ — \z + log [l — z] ^dz 

{-e-^[l-z\}{-\z + \og [z~l\}dz 

-e-5 2 [l -z\}dz 

/ OO 

| — e - ^ 2 [l — ^] j log [z-l]dz 

where the numerics have been obtained by evaluation of expressions which 
Mathematica generated with ease, though it found the final integral to be 
“intractable.” A simple change of variable leads, however, to an elementary 
Laplace transform 

/ oo . . roo 

{e-**[*-l]}log[* —l]d*= ^ e- sv y\ogydz ^ 

_ 1 r l—7—log » 1 

2y/e i s 2 Js=j 

= 1.35369 

so we have H q [Pi] = 1.55100 — 0.60935 i, and the emergence of no evident 
pattern. The evaluation of H q [P 2 \ promises to be even more tedious, and I see 
no way to adapt generating function techniques to the evaluation H q {P n \. The 
moral appears to be that one can expect i7 9 [P]-evaluation to be difficult. Nor 
is this surprising; it is merely a particular manifestation of a pervasive fact: In 
thermalized situations, entropy can be extracted from the partition function— 
thermodynamics supplies the general procedure—where all computational 
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difficulties come into convenient focus (“to know Z is to know everything”). 
But in non-thermalized contexts (i.e., the absence of a partition function) one 
enjoys no such advantage, and can expect entropy calculations almost always 
to be difficult. It is into just such a context that H q [P] plunges us, for (in 
the absence of a successful hidden-variable theory) we certainly do not expect 
the design of P^(x,p) to have anything to do with any interpretation of the 
“thermalization” concept. 



